EvalIta 2011: The Frame Labeling over Italian Texts Task
نویسندگان
چکیده
The FLaIT task held within the EvalIta 2011 challenge is here described. Systems were asked to label semantic frames and their arguments as evoked by input predicate words over plain text sentences. Proposed systems are based on a variety of learning techniques and achieve very good results, over 80% of accuracy, in most subtasks. 1 The Frame Labeling Task In the “Frame Labeling over Italian Texts” (FLaIT) task, the general goal is to come forward with representation models, inductive algorithms and inference methods which address the Semantic Role Labeling (SRL) problem. This is the first time that such a task is proposed in the framework of the EVALITA campaign. So far, a number of shared tasks (CoNLL–2004, 2005, 2008, 2009 and Senseval/Semeval–2004, 2007, 2010) have been concerned with SRL. Typically, two main English corpora with semantic annotations from which to train SRL systems have been used: PropBank [3] and FrameNet [1]. These previous experiences have been focused on developing SRL systems based on partial parsing information and/or increasing the amount of syntactic and semantic input information, aiming to boost the performance of machine learning systems on the SRL task. Since 2009, CoNLL has been accompanied by a shared task dedicated to SRL not restricted to a monolingual setting (i.e. English) [2]. The Evalita 2011 FLaIT challenge is the first evaluation exercise for the Italian language, focusing on different aspects of the SRL task over Italian texts. The interest in organizing this challenge has been prompted by the recent development of FrameNet–like resources for Italian that are currently under development in the iFrame project. 1 http://verbs.colorado.edu/∼mpalmer/projects/ace.html 2 https://framenet.icsi.berkeley.edu/fndrupal/ 3 http://sag.art.uniroma2.it/iframe/doku.php
منابع مشابه
Structured Kernel-Based Learning for the Frame Labeling over Italian Texts
In this paper two systems participating to the Evalita Frame Labeling over Italian Texts challenge are presented. The first one, i.e. the SVM-SPTK system, implements the Smoothed Partial Tree Kernel that models semantic roles by implicitly combining syntactic and lexical information of annotated examples. The second one, i.e. the SVM-HMM system, realizes a flexible approach based on the Markovi...
متن کاملThe AnIta-Lemmatiser: A Tool for Accurate Lemmatisation of Italian Texts
This paper presents the AnIta-Lemmatiser, an automatic tool to lemmatise Italian texts. It is based on a powerful morphological analyser enriched with a large lexicon and some heuristic techniques to select the most appropriate lemma among those that can be morphologically associated to an ambiguous wordform. The heuristics are essentially based on the frequency-of-use tags provided by the De M...
متن کاملGenerative and Discriminative Learning in Semantic Role Labeling for Italian
In this paper, we present a Semantic Role Labeling tool for Italian language for the FLaIT competition at Evalita 2011. This tool presents an hybrid approach to resolve the different sub-tasks that composed the SRL task. We apply a discriminative model for the boundary detection task based on lexical and syntactical features. A distributional approach to modeling lexical semantic information, i...
متن کاملNLP-NITMZ: Part-of-Speech Tagging on Italian Social Media Text using Hidden Markov Model
English. This paper describes our approach on Part-of-Speech tagging for Italian Social Media Texts (PoSTWITA), which is one of the task of EVALITA 2016 campaign. EVALITA is a evaluation campaign, where teams are participated and submit their systems towards the developing of tools related to Natural Language Processing (NLP) and Speech for Italian language. Our team NLP–NITMZ participated in t...
متن کاملbot.zen @ EVALITA 2016 - A minimally-deep learning PoS-tagger (trained for Italian Tweets)
English. This article describes the system that participated in the POS tagging for Italian Social Media Texts (PoSTWITA) task of the 5th periodic evaluation campaign of Natural Language Processing (NLP) and speech tools for the Italian language EVALITA 2016. The work is a continuation of Stemle (2016) with minor modifications to the system and different data sets. It combines a small assertion...
متن کامل